Unsupervised Learning for Syntactic Disambiguation
نویسندگان
چکیده
منابع مشابه
Unsupervised Learning for Syntactic Disambiguation
We present a methodology framework for syntactic disambiguation in natural language texts. The method takes advantage of an existing manually compiled non-probabilistic and nonlexicalized grammar, and turns it into a probabilistic lexicalized grammar by automatically learning a kind of subcategorization frames or selectional preferences for all words observed in the training corpus. The diction...
متن کاملUnsupervised Learning of Disambiguation Rules for Part of Speech Tagging
In this paper we describe an unsupervised learning algorithm for automatically training a rule-based part of speech tagger without using a manually tagged corpus. We compare this algorithm to the Baum-Welch algorithm, used for unsupervised training of stochastic taggers. Next, we show a method for combining unsupervised and supervised rule-based training algorithms to create a highly accurate t...
متن کاملLearning Probabilistic Subcategorization Preference and its Application to Syntactic Disambiguation
This paper proposes a novel method of learning probabilistic subcategorization preference. In the method, for the purpose of coping with the ambiguities of case dependencies and noun class generalization of argument/adjunct nouns, we introduce a data structure which represents a tuple of independent partial subcategorization frames. Each collocation of a verb and argument/adjunct nouns is assum...
متن کاملUnsupervised Personal Name Disambiguation
This paper presents a set of algorithms for distinguishing personal names with multiple real referents in text, based on little or no supervision. The approach utilizes an unsupervised clustering technique over a rich feature space of biographic facts, which are automatically extracted via a language-independent bootstrapping process. The induced clustering of named entities are then partitione...
متن کاملWord-Sense Disambiguation of Sinhala Language with Unsupervised Learning
-Resolving ambiguity requires little conscious effort in human communications. To make decisions about the intended sense of a word we use our broad understanding of the language and the real-world knowledge. Disambiguation in translations is the selection of the intended sense from a known finite set of possible meanings of an ambiguous word. This choice is based upon a probabilistic model tha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computación y Sistemas
سال: 2014
ISSN: 1405-5546
DOI: 10.13053/cys-18-2-2014-035